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Abstract. We investigate the statistical properties of the 
APM-Stromlo redshift survey by using the concepts and meth- 
ods of modern Statistical Physics. We find that galaxy distri- 
bution in this survey exhibits scale invariant properties with 
fractal dimension D = 2.1 ± 0.1, up to ~ AQh~^ Mpc, i.e. the 
limit of its statistical validity, without any tendency towards 
homogenization. No intrinsic characteristic scales are definitely 
found in this galaxy sample. We present several tests to study 
the statistical reliability of the results. 



1. Introduction 

In the debate about the statistical properties of the available 
three dimensional galaxy samples (see Pietronero et a/. 1997 
and Davis 1997 for the two different points of view on the 
subject) it is often claimed (see e.g. Peebles, 1980) that there 
exists a well defined correlation length ro which separates a 
correlated regime (r < ro) from an uncorrelated one (r > ro). 
Such a characteristic length has been identified by the usual 
statistical analysis of spatial galaxy distribution through the 
two points correlation function ^(r) (Peebles, 1980), and it 
is defined as ^(ro) = 1. Although the standard value is es- 
timated to be ro « 5h~^Mpc (Davis & Peebles, 1983; Davis, 
1997) different authors have measured a value for ro larger than 
Wh'^Mpc (e.g. Benoist et ai, 1996; Park et al, 1994), the rea- 
son of such a disagreement being attributed to a "luminosity 
bias": galaxies of different intrinsic luminosities should have 
different correlation lengths. While Davis et a/.(1988) found a 
quantitative result for the shift of ro as a function of absolute 
magnitude, other authors (e.g. Benoist et al., 1996; Park et at, 
1994) neither have confirmed such a behavior nor have given 
an alternative quantitative explanation for such an effect. 

Some years ago we criticized this approach and proposed 
a new one based on the concepts and methods of modern Sta- 
tistical Physics (Pietronero 1987; Coleman & Pietronero 1992; 
Pietronero et al., 1997; Sylos Labini et al., 1997). By using this 
more general framework, we present in this paper the analy- 
sis of the Stromlo-APM redshift survey (Loveday et al., 1992; 
Loveday et al., 1996). 

The main result is that galaxy distribution in this survey 
does not exhibit any characteristic length scale and, on the 
contrary, we find that it is characterized by having scale in- 
variant properties with fractal dimension D = 2.1 ± 0.1 up to 
the statistical limit of this galaxy sample (i.e. ~ 40/i~^A/pc). 



The very first consequence of this result is that the usual 
statistical methods (as for example £,{r)), based on the assump- 
tion of homogeneity, are therefore inconsistent for the analysis 
of irregular distribution such as the one present in this sample: 
unless a well defined cut-off towards homogenization has been 
identified, all the results based on the ^(r), such as the so-called 
luminosity segregation effect, are artifacts. This is the origin 
of the confusing statements about "luminosity segregation". In 
Sylos Labini & Pietronero (1996) we have clarified this point by 
showing that galaxies of different morphology indeed have dif- 
ferent clustering properties (for example ellipticals are mainly 
in the core of rich galaxy clusters, while spirals are in the field), 
but this fact has no relation with the increasing of ro found in 
galaxy catalogs (Sylos Labini et al., 1997). The segregation of 
galaxies with different morphology is, on the contrary, related 
to the multifractal nature of galaxy distribution, i.e. to the 
correlation between galaxy positions and luminosities. 

Our results are therefore in contrast with those of Love- 
day et aZ.1996, where a well defined correlation length ro ~ 
5h~^Mpc has been identified. We clarify the reason of such a 
disagreement, and we present various tests to asset the robust- 
ness of the present analysis. 

2. The sample 

The Stromlo-APM Redshift Survey (SARS) consists of 1797 
galaxies with bj < 17.15 selected randomly at a rate of 1 
in 20 from APM scans (Loveday et al., 1992; Loveday et al., 
1996). The survey covers a solid angle of = 1.3sr in the 
south galactic hemisphere, delimited by 21'' ^ a <■ 5'' and 
—72.5° ^ 5 <^ —17.5°. An important selection effects exists: 
galaxies with apparent magnitude brighter than bj = 14.5 are 
not included in the sample because of photographic saturation 
(see Fig.^). Moreover in a magnitude limited sample another 
important selection effect must be considered: at every distance 
in the apparent magnitude limited survey, there is a definite 
limit in intrinsic luminosity which is the absolute magnitude 
of the faintest galaxy visible that distance. Hence at large dis- 
tances, intrinsically faint objects are not observed whereas at 
smaller distances they are observed. In order to analyze the 
statistical properties of galaxy distribution without introduc- 
ing any a priori assumption, a catalog without this selection 
effect must be used. In general, it exists a very well known 
procedure to obtain a sample unbiased by this luminosity se- 
lection effect: this is the so-called "volume limited" (VL) sam- 
ple. A VL sample contains every galaxy in the volume which 
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Sample 


ARvL 






N 


{() 


WL12 


100 ^ 200 


-19.35 ^ 


- -20.5 


451 


7.3 


WL51 


50 ^ 100 


-17.85^ 


- -19.0 


170 


4.6 


WL153 


150 ~ 300 


-20.2 ~ 


-21.38 


230 


14.5 



Table 2. The VL subsamples of the APM catalog with a dou- 
ble cut in distance di.Rv L{h~^ Mpc) and absolute magnitude 



Fig. 1. Absolute magnitude versus distance diagram for the SARS 
catalog. There are shown the two limiting function with apparent 
magnitude m}. = 14.5 and m?. = 17.15. The survey is complete 
up to ~ 17 — th magnitude and does not contain galaxies brighter 
than ~ 14.5. 



The measured velocities of the galaxies have been expressed 
in the preferred frame of the Cosmic Microwave Background 
Radiation (CMBR), i.e. the hehocentric velocities of galaxies 
have been corrected for the solar motion with respect to the 
CMBR, according with the formula v = Vm + 316cos0 kms~^ 
where i; is the corrected velocity, Vm is the observed veloc- 
ity and 9 is the angle between the observed velocity and the 
direction of the CMBR dipole anisotropy (a = 169.5° and 
6 = —7.5°). From these corrected velocities, we have calcu- 
lated the comoving distances r(z), with for example go = 0.5, 
by using the Mattig's relation (see e.g. Park et al, 1994) 

r{z) = 6000 I 1 , ^ I h~^Mpc . (1) 

It is important to stress that the analyses presented here 
have been performed in redshift space. We have not applied 
any correction to take into account the effect of peculiar ve- 
locity distortions. However we expect that these corrections 
are negligible on scales larger than ^ 10h~^ Mpc, due to the 
fact that the amplitude of peculiar motions is not larger than 
~ lOOOfcms"^ 



is more luminous than a certain limit, so that in such a sample 
there is no incompleteness for any observational luminosity se- 
lection effect (see for example Davis & Peebles, 1983; Coleman 
& Pietronero, 1992). 

In order to construct VL samples we have adopted two dif- 
ferent procedures. The first is the standard one, i.e. we have 
introduced an upper cutoff in the distance and computed the 
corresponding cutoff in absolute magnitude. The characteris- 
tics of these samples are reported in Tab.hl 



Sample 


RvL{h-'Mpc) 




N 


{£){h-'Mpc) 


VL18 


107 


-18.0 


325 


4.2 


VL19 


170 


-19.0 


481 


6.2 


VL20 


269 


-20.0 


403 


11 



Table 1. The VL subsamples of the APM catalog : Rvl is the 
distance corresponding to the absolute magnitude limit of the 
sample Mum\ N is the number of points contained and {£) is 
the average distance between neighbor galaxies. 



The second procedure consists by putting two limits in dis- 
tances and compute the corresponding two limits in absolute 
magnitude (see TabJ^ where WL12 means the VL sample lim- 
ited by the distance range 100 200h~^ Mpc). In such a way 
we can avoid the selection effects due to the fact that in this 
survey are not included galaxies brighter than 14.5. 



3. The Average Conditional Density 

We start by recalling the concept of correlation. If the presence 
of an object at the point ri influences the probability of finding 
another object at r-2, these two points are correlated. Therefore 
there is a correlation at r if, on average 

G(r) = (n(O)n(r)) / (n> (2) 

where we average on all occupied points chosen as origin. 
On the other hand, there is no correlation if 

G(r) ^ {nf . (3) 

The physically meaningful definition of the homogeneity 
scale Ao is therefore the length scale which separates correlated 
regimes from uncorrelated ones. 

In practice, it is useful to normalize the correlation function 
(CF) to the size of the sample analyzed. Then we use, following 
Coleman & Pietronero (1992) 

^ ^ < n(r)n(0) > ^ G{r) 

<n> <n> ^ ' 

where < n > is the average density of the sample. We 
stress that this normalization does not introduce any bias even 
if the average density is sample-depth dependent, as in the case 
of fractal distributions, because it represents only an overall 
normalizing factor. In order to compare results from different 
catalogs it is however more useful to use r(r), in which the 
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size of a catalog only appears via the combination A'^"^ Si^i' 
so that a larger sample volume only enlarges the statistical 
sample over which averages are taken. On the contrary G(r) 
has an amplitude that is an explicit function of the sample's 
size scale. 

r(r) measures the average density within a spherical shell 
of thickness Ar at distance r from an occupied point at ri, 
and it is called the conditional average density (Coleman & 
Pietronero, 1992). Such a function can can be estimated by 

where D is the fractal dimension and the prefactor B is in- 
stead related to the lower cut-offs, i.e. the lower scale at which 
the self-similarity is broken (Coleman & Pietronero, 1992; Sy- 
los Labini et al, 1997). The quantity N{r) in eqj^ is the actual 
number of points over which the average is done. This is in gen- 
eral a function of the radius r of the shell, because we consider 
only those points for which the spherical shell of radius r in en- 
tirely contained in the sample. Clearly as r grows the number 
of points N{r) decreases, and for this reason the determination 
of r(r) for large r is more noisy (see below). 

If the distribution is fractal up to a certain distance Aq, and 
then it becomes homogeneous, T{r) has a power law decaying 
with distance up to Aq, and then it flattens towards a constant 
value. Hence by studying the behavior of r(r) it is possible to 
detect the eventual scale-invariant versus homogeneous prop- 
erties of the sample. 

In general, it is also very useful to use the integrated con- 
ditional density 

r*(r) = ^ r A7vr'^r{r')dr' = —r''-^ (6) 
inr^ An 

This function produces an artificial smoothing of rapidly 
varying fluctuations, but it correctly reproduces global prop- 
erties (Coleman & Pietronero, 1992). For a fractal structure, 
r(r) has a power law behavior and the integrated conditional 
density is 

r*(r) = -|r(r). (7) 

For an homogeneous distribution {D = 3) these two func- 
tions are exactly the same and equal to the average density. 

Contrary to the conditional density, the information given 
by the ^(r) function is biased by the a priori (untested) assump- 
tion of homogeneity. Pietronero and collaborators (Pietronero, 
1987; Coleman & Pietronero, 1992; Sylos Labini et ai, 1997) 
have clarified some crucial points of the standard correlations 
analysis, and in particular they have discussed the physical 
meaning of the so-called "correlation length" ro found with 
the standard approach and defined by the relation ^(ro) = 1 
where 

= <n(ro)n(ro + r)> _ ^ 
^ ' < n >2 ^ ' 

is the two points correlation function used in the standard 
analysis. The basic point in the present discussion, is that the 
mean density (n) used in the normalization of (,{r), is not 
a well defined quantity in the case of self-similar distribution 



and it is a direct function of the sample size. Hence only in 
the case that homogeneity has been reached well within the 
sample limits the ^(r)-analysis is meaningful, otherwise the a 
priori assumption of homogeneity is incorrect and characteris- 
tic lengths, like ro, became spurious (see e.g. Sylos Labini et 
ai, 1997 for an exhaustive discussion of the matter). 

Given a certain spherical sample with solid angle and 
depth Rs , it is important to define which is the maximum dis- 
tance up to which it is possible to compute the correlation 
function (r(r) or (,{r)). As discussed in Coleman & Pietronero 
(1992), we have limited our analysis to an effective depth Reff 
that is of the order of the radius of the maximum sphere fully 
contained in the sample volume. The reason why r(r) (or ^(r)) 
cannot be computed for r > Reff is essentially the follow- 
ing. When one evaluates the correlation function (or power 
spectrum - see Sylos Labini & Amendola 1996) beyond Reff, 
then one makes explicit assumptions on what lies beyond the 
sample's boundary. In fact, even in absence of corrections for 
selection effects, one is forced to consider incomplete shells cal- 
culating r(r) for r > Reff, thereby implicitly assuming that 
what one does not find in the part of the shell not included in 
the sample is equal to what is inside. 

The maximum depth of a reliable statistical analysis, is lim- 
ited by the radius of the sample (as previously discussed), while 
the minimum distance depends on the number of points con- 
tained in the volume and on the fractal dimension. For a Pois- 
son distribution the mean average distance between nearest 
neighbors is of the order {£) ~ (V/N)'^ . It is possible to com- 
pute the average distance between neighbor galaxies {£), in a 
fractal distribution with dimension D, and the result is 

»^(i)*r(i4) (9) 

where F is the Euler's gamma-function (Sylos Labini et al., 
1997b). Clearly this quantity is related to the lower cut-off of 
the distribution B (eq.^) and to the fractal dimension D. If 
we measure the conditional density at distances r < {£), we 
are affected by a finite size effect. In fact, due the depletion of 
points at these distances we underestimate the real conditional 
density finding an higher value for the correlation exponent 
(and hence a lower value for the fractal dimension). In the 
limiting case at the distance r <g {£), we can find almost no 
points and the slope is 7 = —3 (D = 0). In general, when one 
measures F(r) at distances which correspond to a fraction of 
{£), one finds systematically an higher value of the conditional 
density exponent. Such a trend is completely spurious and due 
to the depletion of points at such distances. It is worth to notice 
that this effect gives rise to a curved behaviour of F* (r) (eq.^ 
at small distances, because of its integral nature. 

An important point in the study of galaxy distribution, is 
that galaxies are characterized by having very different intrin- 
sic luminosities. In order to take into account this effect, in 
what follows we assume that 

KL,r) = 0(L)F(r) , (10) 

i.e. that the number of galaxies for unit luminosity and 
volume u{L,r) can be expressed as the product of the space 
density r(r) (eqj^ and the luminosity function 4'{L) {L is the 
intrinsic luminosity). This is a crude approximation in view 
of the multifractal properties of the distribution (correlation 
between position and luminosity). However, for the purpose of 
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the present discussion, the previous approximation is rather 
good and the explicit consideration of the multifractal prop- 
erties have a minor effect on the properties discussed (Sylos 
Labini & Pietronero, 1996). 

In view of eq.p^, to each VL sample (limited by the absolute 
magnitude Mvl) we can associate the luminosity factor 

ct>{M)dM (11) 

- oo 

that gives the fraction of galaxies for unit volume, present 
in the sample. Hereafter we adopt the following normalization 
for the luminosity function 

(f>{M)dM = 1 (12) 

-oo 

where Mmin ~ —10-: — 12 is the fainter galaxy present in 
the available samples. The luminosity factor of Eq.llll is useful 
to normalize the space density in different VL samples which 
have different Mvl- 

4. Three dimensional properties 

We have computed the conditional average density for the VL 
samples with only one cut in absolute magnitude, and we show 
in FigJd the results. The fractal dimension is D = 2.1±0.1 up to 



regimes: the first one at small distances r < 2-{r6h~^ Mpc shows 
a fluctuating nature and in general a more steeper decay. This 
is due to the finite size effect discussed previously: at these 
scales we are in the limit r < {£) as it results from Tab.^. Then, 
at large scale, r*(r) shows a well defined power law decay in 
the range 2 H- Gh'^Mpc < r < 30 H- iOh'^Mpc in the different 
samples. The last few points are a noisy because the number 
of points N{r) in the average of eq.^ is rather small. 

We have evaluate the statistical errors corresponding to 
the density measurements. The standard method for comput- 
ing the correlation function errors is through the technique of 
bootstrap resampling (Ling et ai, 1986). We have generated a 
series of A'^ = 100 bootstrap data sets of the same size of the 
original data set by randomly choosing the bootstrap galaxies 
from the original sample. Each bootstrap sample contains 100 
randomly selected galaxies. 

In general for r < {£) the signal is rather noisy and the 
bootstrap errors are large. We have eliminated those points, 
at small scale, for which the statistical error is larger than 
the signal itself. Moreover we have considered the determina- 
tion of the conditional density only if the average is performed 
over more than 20 points. This is because, especially at large 
scales, there are very few points which contribute to the aver- 
age. Moreover the statistical errors have been computed from 
those points, which are in the center of the sample: such a sit- 
uation may introduce a systematic effect which may perturb 
the behaviour of the conditional density at large scale. 

In order to check whether the luminosity incompleteness of 
the sample for apparent magnitude brighter than 14.5 affects 
substantially the trends found in Fig.^ we have computed the 
conditional average density for the VL samples, with two cuts 
in distance and two in absolute magnitude. These samples are 
defined by a lower and an upper cut in distance Even in this 
case (FigJsf) the fractal dimension turns out to be D = 2.2 ±0.2 
up to Reff ~ 20h~^ Mpc. This agreement is essentially due to 
the fact that the galaxies with apparent magnitude brighter 
than 14.5 represent a small fraction of the total sample, and 
do not perturb the final result. Clearly in this case the effective 
volume is smaller and so Reff- 

As long as the space and the luminosity density can be con- 
sidered independent, the normalization of T{r) in different VL 
samples can be simply done by dividing its amplitude for the 
corresponding luminosity factor. Of course such a normaliza- 
tion is parametric, because it depends on the two parameters 
of the luminosity function 5 and M* For a reasonable choice of 
these two parameters we find that in different VL samples the 
amplitude of the conditional density matches quite well (see 
Fig.^) For the SARS catalog the parameters of the luminosity 
function are 5 — —1.1 and M* = —19.5 (Loveday et ai, 1996). 



Fig. 2. The redshift space conditional average density computed for 
some VL sample of the SARS redshift survey. The fractal dimension 
is -D = 2.1±0.1, depending on the VL sample used. We have evaluate 
the errors corresponding to the density measurements through the 
technique of bootstrap resampling (see text) 

Reff ~ 40ft~^A/pc. In Figj^ we may recognize three different 



5. Discussion and conclusions 

In order to understand the reason of the disagreement of our re- 
sults with those of Loveday et aL(1996), it is useful to consider 
the behavior of the standard correlation function for a fractal 
distribution. Following Pietronero (1987), the expression of the 
5(r) (eqj^ in the case of fractal distribution, is 

CM = ((3-7)/3)(r/i?.)-^-l (13) 

where Rs (the effective sample radius) is the radius of 
the spherical volume where one computes the average density. 
From EqO it follows that 
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Fig. 3. The redshift space conditional average density computed 
for some VL sample of the SARS redshift survey, with two cuts in 
distance and absolute magnitude. Such a procedure avoids the lumi- 
nosity incompleteness of galaxies with apparent magnitude brighter 
than 14.5. The fractal dimension is D = 2.1 it 0.1. We have evalu- 
ate the statistical errors corresponding to the density measurements 
through the technique of bootstrap resampling (see text) 

i. ) the so-called correlation length vq (defined as (,{ro) — 1) 
is a linear function of the sample size Rs 

ro = {{3-'Y)/6)^Rs (14) 

and hence it is a quantity simply related to the sample 
size. Eq.^ explains the result of Loveday et aL(1996) for ro. 
A minor discrepancy is due to the using of weighting schemes 
and the treatment of boundary conditions. 

ii. ) 5(r) is a power law only for 

((3-7)/3)(r/i?,)--»l (15) 

hence for r < ro: for larger distances there is a clear devi- 
ation from a power law behavior due to the definition of ^{r). 
This deviation, however, is just due to the size of the obser- 
vational sample and does not correspond to any real change 
of the correlation properties. It is clear that if one estimates 
the exponent of ^(r) at distances r < ro, one systematically 
obtains a higher value of the correlation exponent due to the 
break of ^(r) in the log- log plot. This is actually the case for 
the analyses performed so far: in fact, usually, ^(r) is fitted 
with a power law in the range 0.5ro ^ r < 2ro. In this case 
one obtains a systematically higher value of the correlation ex- 
ponent. In particular, the usual estimation of this exponent 
by the ^(r) function leads to is 7 ~ 1.7, different from 7 ~ 1 
(corresponding to D « 2) that we found by means of the r(r) 
analysis (see Sylos Labini et al, 1997 for more details). 



Fig. 4. The redshift space spatial conditional average density r*(r) 
computed in some VL samples of the APM-Stromlo and normalized 
to the luminosity factor. 

Our conclusion is therefore that the usual methods of anal- 
ysis are intrinsically inconsistent with respect to the proper- 
ties of this galaxy sample. The correct statistical analysis of 
the experimental data, performed with the methods of mod- 
ern Statistical Physics, shows that the distribution of galaxies 
is fractal up to the limit of the SARS sample. These methods, 
which are able to identify self-similar and non-analytical prop- 
erties, allow us to test the usual homogeneity assumption of 
luminous matter distribution. The result is that galaxy distri- 
bution in this sample is fractal with D = 2.1 ± 0.1, and this 
is in agreement with the analyses of different samples (see e.g. 
Montuori et al, 1997). These results have a number of theo- 
retical implications which are discussed in more detail in Sylos 
Labini et a/. 1997. 
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